Deformable Part-based Fully Convolutional Network for Object Detection
نویسندگان
چکیده
Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultaneously brings more invariance for classification and geometric information to refine localization. DP-FCN is composed of three main modules: a Fully Convolutional Network to efficiently maintain spatial resolution, a deformable part-based RoI pooling layer to optimize positions of parts and build invariance, and a deformation-aware localization module explicitly exploiting displacements of parts to improve accuracy of bounding box regression. We experimentally validate our model and show significant gains. DP-FCN achieves state-of-the-art performances of 83.1% and 80.9% on PASCAL VOC 2007 and 2012 with VOC data only.
منابع مشابه
Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery
Convolutional neural networks (CNNs) have demonstrated their ability object detection of very high resolution remote sensing images. However, CNNs have obvious limitations for modeling geometric variations in remote sensing targets. In this paper, we introduced a CNN structure, namely deformable ConvNet, to address geometric modeling in object recognition. By adding offsets to the convolution l...
متن کاملConvolutional Gating Network for Object Tracking
Object tracking through multiple cameras is a popular research topic in security and surveillance systems especially when human objects are the target. However, occlusion is one of the challenging problems for the tracking process. This paper proposes a multiple-camera-based cooperative tracking method to overcome the occlusion problem. The paper presents a new model for combining convolutiona...
متن کاملRegion-based Deformable Fully Convolutional Networks for Multi-class Object Detection at Signalized Traffic Intersections
Multi-class object detection is critical for intelligent traffic monitoring applications in smart cities as well as connected autonomous vehicles. Although, numerous research works evaluate the performance of image processing algorithms for onvehicle cameras, the body of research evaluating performance of image processing of stationary cameras located near intersections is limited. In this rese...
متن کاملA Two-Dimensional Convolutional Neural Network for Brain Tumor Detection From MRI
Aims: Cancerous brain tumors are among the most dangerous diseases that lower the quality of life of people for many years. Their detection in the early stages paves the way for the proper treatment. The present study aimed to present a two-dimensional Convolutional Neural Network (CNN) for detecting brain tumors under Magnetic Resonance Imaging (MRI) using the deep learning method. Methods & ...
متن کاملObject Detection with Mask-based Feature Encoding
Region-based Convolutional Neural Networks (R-CNNs) have achieved great success in the field of object detection. The existing R-CNNs usually divide a Region-of-Interest (ROI) into grids, and then localize objects by utilizing the spatial information reflected by the relative position of each grid in the ROI. In this paper, we propose a novel featureencoding approach, where spatial information ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1707.06175 شماره
صفحات -
تاریخ انتشار 2017